Journal of Experimental Algorithmics 



Irredundant Intervals 

Donald E. Knuth 
Computer Science Department, Stanford University 



Abstract. This expository note presents simplifications of a theorem due to Gyori 
and an algorithm due to Franzblau and Kleitman: Given a family F of m intervals on a 
linearly ordered set of n elements, we can construct in 0(m + n) 2 steps an irredundant 
subfamily having maximum cardinality, as well as a generating family having minimum 
cardinality. The algorithm is of special interest because it solves a problem analogous 
to finding a maximum independent set, but on a class of objects that is more general 
than a matroid. This note is also a complete, runnable computer program, which 
can be used for experiments in conjunction with the public-domain software of The 
Stanford GraphBase. 
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1. Introduction. Let's say that a family of sets is irredundant if its members can be arranged in a 
sequence with the following property: Each set contains a point that isn't in any of the preceding sets. 

If F is a family of sets, we write F u for the family of all nonempty unions of elements of F. When F 
and G are families with F C G u , we say that G generates F. If F is irredundant and G generates F, we 
obviously have |_F| < \G\, because each set in the sequence requires a new generator. 

In the special case that the members of F are intervals of the real line, Andras Frank conjectured that 
the largest irredundant subfamily of F has the same cardinality as F's smallest generating family. This 
conjecture was proved by Ervin Gyori [4], who noted that such a result was a minimax theorem of a 
new type, apparently unrelated to any of the other well-known minimax theorems of graph theory and 
combinatorics. A constructive proof was found shortly afterwards by Franzblau and Klcitman [3], who 
sketched an algorithm to find a generating family and irredundant subfamily of equal cardinality. (Gyori, 
Franzblau, and Klcitman were led to these results while studying the more general problem of finding a 
minimum number of subrectangles that cover a given polygon. Further information about polygon covers 
appears in [3] and [1].) 

The purpose of this note is to describe the beautiful algorithm of Franzblau and Klcitman in full detail. 
Indeed, the CWEB source file that generated this document is a computer program that can be used in con- 
nection with the Stanford GraphBase [8] to find maximum irredundant subfamilies and minimum generating 
families of any given collection of intervals. Perhaps this new exposition will shed new light on the class of 
optimization problems for which an efficient algorithm exists. 

According to the conventions of CWEB [9], the sections of this document arc sequentially numbered 1, 2, 3, 
etc. In this respect we are returning to a style of exposition used by Euler and Gauss and their contemporaries. 
A CWEB program is also essentially a hypertext; therefore this document may also be regarded as experimental 
in another sense, as an attempt to find new forms of exposition appropriate to modern technology. 

Note: Gyori used the term "U-increasing" for an irredundant family; Franzblau and Klcitman called such 
intervals "independent." Since a family of sets is a hypergraph, it seems unwise to deviate from the standard 
meaning of independent edges, yet "U-increasing" is not an especially appealing alternative. We will see 
momentarily that the term "irredundant" is quite natural in theory and practice. 

2. A far-reaching generalization of Gyori's theorem was proved recently by Frank and Jordan [2], who 
introduced a large new family of minimax theorems related to linking systems. In particular, Frank and 
Jordan extended Gyori's results to intervals on a circle instead of a line. But no combinatorial algorithm is 
known as yet for the circular case. 

Can any or all of the Franzblau/Klcitman methods be "lifted" to such more general problems? We 
will return to this tantalizing question after becoming familiar with Franzblau and Kleitman's remarkable 
algorithm. 
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3. Theory. It is wise to study the theory underlying the Franzblau/Kleitman algorithm before getting 
into the program itself. 

4. A family of sets is called redundant if it is not irrcdundant. Any family that contains a redundant 
subfamily is redundant, since any family contained in an irredundant family is irredundant. 

5. If F is a family of sets and s is an arbitrary set, let F\s denote the sets of F that are contained in s. 
This operation is left-associative by convention: F\s\t = (F\s)\t — F\(sD t). 

We also write [j F for (J{/ I / € F}; thus F\\JF = F. 

(An index to all the main notations and definitions that we will use appears at the end of this note.) 

6. Lemma. A finite family F is redundant if and only if there is a nonempty set s such that every point 
of s belongs to at least two members of F\s. (The set s need not belong to F.) 

Proof: If F is irredundant there is no such s, because F\s is irredundant and its last set in the assumed 
sequence contains a point that isn't in any of the others. But if F is redundant, it contains a minimal 
redundant subfamily F : then we have 

/C[J(Fo\ {/}) for all f e F , 

since Fq \ {/} is irredundant. It follows that every point of s = [JFo is contained in at least two members 
of i<b, hence in at least two members of F\s (since Fn C F\s). 

7. Corollary. A hnite family F of intervals on a line is redundant if and only if there is an interval s such 
that every point of s belongs to at least two intervals of F\s. (The set s need not belong to F.) 

Proof: Intervals are nonempty. By the proof of the preceding lemma, it suffices to consider sets s that can 
be written (J Fq for some minimal redundant subfamily Fo . In the special case of intervals, (J Fo must be a 
single interval; otherwise F would not be minimal. 

8. Henceforth we will restrict consideration to finite families F of intervals on a linearly ordered set. It 
suffices, in fact, to deal with integer elements; we will consider subintervals of the n-elemcnt set [0 . . n). (The 
notation [a . . b) stands here for the set of all integers x such that a < x < b.) 

If F is a family of sets and a; is a point, we will write N x F for the number of sets that contain x. The 
corollary just proved can therefore be stated as follows: "F is irredundant if and only if every interval 
s C (J F contains a point x with N x F\s < 1." This characterization provides a polynomial-time test for 
irredundancy. 

9. Irrcdundant intervals have an interesting connection to the familiar computer-science concept of binary 
search trees (see, for example, [7, §6.2.2]): A family of intervals is irrcdundant if and only if we can associate 
its intervals with a binary tree whose nodes are each labeled with an integer x and an interval containing x. 
All nodes in the left subtree of such a node correspond to intervals that are strictly less than x, in the sense 
that all elements of those intervals are < x; all nodes in the right subtree correspond to intervals that arc 
strictly greater than x. The root of the binary tree corresponds to the interval that is last in the assumed 
irredundant ordering. Its distinguished integer x is an element that appears in no other interval. 

Given such a tree, we obtain a suitable irredundant ordering by traversing it recursively from the leaves 
to the root, in postorder [6, §2.3.1]. Conversely, given an irredundant ordering, we can construct a binary 
tree recursively, proceeding from the root to the leaves. 
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10. An example might be helpful at this point. Suppose n — 9 and 

/i = [0..8), / 2 = [0..7), / 3 = [1..6), / 4 = [1..5), / B = [3..9), / 6 = [2..9). 

Then {fi, f2, f3, fi, fs} and /3, /s, /s} are irredundant. (Indeed, a family of intervals is irredundant 
whenever its members have no repeated left endpoints or no repeated right endpoints.) These subfamilies 
are in fact maximally irredundant — they become redundant when any other interval of the family is added. 
Therefore maximal irredundant subfamilies need not have the same cardinality; irredundant subfamilies do 
not form the independent sets of a matroid. 

On the other hand, irredundant sets of intervals do have matroid-like properties. For example, if F is 
irredundant and F U {g} is redundant, there is an / ^ g such that F U {g} \ {/} is irredundant. (The proof 
is by induction on \F\: There is x £ / G F such that F — Fi U {/} U F r , where Fi and F r correspond to 
the left and right subtrees of the root in the binary tree representation. If x G g, the family Fi U {g} U F r is 
irredundant; if x < g, there is /' G F r with Fi U {/} U (F r U {g} \ {/'}) irredundant, by induction: if x > g 
there is /' G Fi with (Fi U {g} \ {/'}) U {/} U F r irredundant.) Such near-matroid behavior makes families 
of intervals especially instructive. 

11. Let's say that an interval s is good for F if N x F\s < 1 for some x G s; otherwise s is bad. Franzblau 
and Kleitman introduced a basic reduction procedure for any family F of intervals that possesses a bad 
interval s. Their procedure is analogous to modification along an augmenting path in other combinatorial 
algorithms. 

Let [ai . . b\), . . . , [ofc . . bk) be the maximal intervals in F\s, ordered so that a\ < ■ ■ ■ < and b\ < ■ ■ ■ < bk- 
(Notice that s = [ai . . bk)-) For example, we might have the following picture: 

s 



di • o b\ 

a2 • o 6 2 

a 3 • o 6 3 

04 • o 64 

dfc • 

• o (non-maximal intervals make s bad) • 

If a,j + i < bj for 1 < j < k, the family F reduced in s is defined to be 

F[s = F \ {[ai . . 61), . . . , [ofc . . b k )} U {[a 2 . . 61), . . . , [a fe . . b k -i)} ■ 

In the simplest case we have k = 1 and the reduced family is simply F \ {[a\ . . b\)}. 

If s is a minimal bad interval for F, we can prove that aj + i < bj for 1 < j < k, hence Fjs is well-defined. 
Indeed, point Oj+i must be in some interval [c. . d) other than [dj+\ ■ ■ bj+i), since s is bad. We can assume 
that c < Oj+i; otherwise all intervals of F|s would be contained in [a\ . .dj+i) or [flj+i . .afe), and both of 
these subintervals would be bad, contradicting the minimality of s. But c < aj + \ implies that [c..d) is 
contained in some maximal [aj . . 6j) with i < j. Hence cij+i < bi < bj. 

The notation FJ.S is defined to be left-associative, like F\s; that is, F|s|i = {Fls)it and |t = (Fls)\t. 

12. Lemma. Jf s is a minimal bad interval for F, we have F C (i^|s) u . 

Proof: We must show that [dj . .bj) G (F|s) u for 1 < j < k. Let G = F[s\[aj . .bj); we will prove 
that [a,j..bj) — [JG. If x £ [aj..bj), the badness of s implies that x £ t for some t £ F\s with 
t 7^ [a.j . . bj); let t be contained in the maximal interval [ai . . bi). If i = j, we have t £ G; if i < j, we 
have x £ [a,j . .bi) C [aj . . &j-i) G G; and if i > j, we have x £ [en . . bj) C [aj+i . . bj) £ G. 



o 6 fe 
o 
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13. Lemma. Suppose s is a minimal bad interval for F, while t is a good interval. Then t is good also 
for Fls. 

Proof: Let s — [a. .b),t= [c . . d), I = min(a, c), r = max(6, d). Suppose N x F\ [I . . d) > 2 and N x F\[c..r) > 
2 for all x 6 t. Then l = a<c<d<b = r, because t is good for F. By the minimality of s, there is some 
x G [a . .d) with N x F\ [a . . d) -x < 1. Since N x F\ [a . . d) > 2 we have x < c. Furthermore, x is in some interval 
of (F\s) \F\[a. . d), because s is bad; so x is in some maximal [a,j . . bj) with a<aj<x<c<d<bj<b. It 
follows that none of the intervals [a\ . . . . . , [a k . . b k ), [a 2 . . b\), . . . , [a k . . b k -i) are contained in t, hence 
Fls\t = F\t. 

On the other hand, suppose N x F\ [I . . d) < 1 for some x 6 t. We will show that N x F\,s\t < 1. This 
assertion can fail only if x lies in some interval [aj+i . . bj) C t newly added to Fls. Then x £ [a,j . .bj) C 
[I . . d), and x G [cij+i ■ ■ bj+i) ^ [I . . d), hence bj < d < it follows that j is uniquely determined, and the 
only interval containing x in F! s \ t is [aj+i . . bj). A similar argument applies if N x F\ [c . . r) < 1. 

14. Corollary. If s is a minimal bad interval for F, we have N x F[s = N x F — N x s. 

Proof: The proof of the preceding lemma shows in particular that none of the intervals [cij+i ■ - bj) are already 
present in F before the reduction. And if x € s, suppose x lies in [oj . . bi), [aj+i . . . . . , [dj ■ ■ bj); then 

it lies in [a i+1 . . bi), . . . , [aj . . bj-i) after reduction, a net change of —1. 

15. The Franzblau/Kleitman algorithm has a very simple outline: We let Go — F and repeatedly set 
Gfe + i = G k \,s kl where s k is the leftmost minimal bad interval for G k , until we finally reach a family G r 
in which no bad intervals remain. This must happen sooner or later, because \G k \ = \F\ — k. The final 
irredundant family G = G r generates F, because F C G k for all k by the lemma of §12. Franzblau and 
Kleitman proved the nontrivial fact that \G\ is the size of the maximum irredundant subfamily of F; hence 
G is a minimum generating family. 

16. It is tempting to try to prove the optimality of G by a simpler, inductive approach in which we "grow" 
F one interval at a time, updating its maximum irredundant set and minimum generating set appropriately. 
But experiments show that the maximum irredundant set can change drastically when F receives a single 
new interval, so this direct primal-dual approach seems doomed to failure. The indirect approach is more 
difficult to prove, but no more difficult to program. So we will proceed to develop further properties of 
Franzblau and Kleitman's reduction procedure [3]. The key fact is a remarkable theorem that we turn 
to next. 

17. Theorem. The same final family G = G r is obtained when s k is chosen to be an arbitrary (not 
necessarily leftmost ) minimal bad interval of G k in the reduction algorithm. Moreover, the same multiset 
{so, . . . , s r _i} of minimal bad intervals arises, in some order, regardless of the choices made at each step. 
Proof: We use induction on r, the maximum number of steps to convergence among all reduction procedures 
that begin with a family F. If r = 0, the result is trivial, and if F has only one minimal bad interval the 
result is immediate by induction. Suppose therefore that s and t are distinct minimal bad intervals of F. We 
will prove later that t is a minimal bad interval for F[s, and that F[s[t = F[t[s. Let r' be the maximum 
distance to convergence from Fls, and r" the maximum from Fit; then r' and r" are less than r, and 
induction proves that the final result from F[s is the final result from F[slt = Fltls, which is the final 
result from Fit. (Readers familiar with other reduction algorithms, like that of [5], will recognize this as a 
familiar "diamond lemma" argument. We construct a diamond-shaped diagram with four vertices: F, Fls, 
Fit, and a common outcome of Fls and Fit.) This completes the proof, except for two lemmas that will 
be demonstrated below; their proofs have been deferred so that we could motivate them first. 
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18. This theorem and the lemma of §13 have an important corollary: Let S — {so, ■ ■ ■ , s r -\} be the 
multiset of minimal bad intervals determined by the algorithm from F, and let t be any interval. Then S\t is 
the multiset of minimal bad intervals determined by the algorithm from F\t. This holds because an interval 
s C t is bad for F if and only if it is bad for F\t. Minimal bad intervals within t never appear again once 
they are removed, and we can remove them first. 

Reducing a minimal bad interval s when s is contained in a bad interval t may make t good, or leave it 
bad, or make it minimally bad. If s is minimally bad for F, it might also be minimally bad for Fls. 

19. Now we are ready for the coup de grace and the piece de resistance: After the reduction algorithm has 
computed the irrcdundant generating family G = G r and the multiset S of minimal bad intervals, we can 
construct an irredundant subfamily F' of F with = \G\ by constructing a binary search tree as described 
in §9. The procedure is recursive, starting with an initial interval t = [0 . . n) that contains F: The tree 
defined for F\t is empty if F\t is empty. Otherwise it has a root node labeled with x and with any interval 
of F\t containing x, where x is an integer such that N X G^ = 1; here G^ is the final generating set that 
is obtained when the reduction procedure is applied to F\t. A suitable interval containing x exists, because 
every element of G^ is an intersection of intervals in F\t. The left subtree of the root node is the binary 
search tree for F\ (t fl [0 . . x)) ; the right subtree is the binary search tree for F\ (t fl [x + 1 . . n)) . 

The number of nodes in this tree is \G\. For if x is the integer in the label of the root, G has one interval 
containing x, and its other intervals are G\ [0 . . x) and G\ [x + 1 . . n). The family <?W is not the same as G\t; 
but we do have \G^ \ = \G\t\ when t has the special form [0 . . x) or [x + 1 . . n), because in such cases Fls \t 
has the same cardinality as F\t when s is a minimal bad interval and s %t. For example, if t = [0 . . x) and 
bj <x < bj + i, we have F[s \t = F\t \ {[ai . . bi), . . . ,[aj . . bj)} U {[a 2 . . &i), . . . , [a^+i . . bj)}. 

20. It is not necessary to compute each from scratch by starting with F\t and applying the reduction 
algorithm until it converges, because the binary tree construction algorithm requires only a knowledge of 
the incidence function N x G^K This function is easy to compute, because N x Fls — N x F — N x s by §14; 
therefore 

N x G®=N x F\t-N x S\t. 

21. All the basic ideas of Franzblau and Klcitman's algorithm have now been explained. But we must 
still carry out a careful analysis of some fine points of reduction that were claimed in the proof of the main 
theorem. If s and t are distinct minimal bad intervals, the lemma of §13 implies that no bad subintervals 
of t appear in F[s; we also need to verify that t itself remains bad. 

Lemma. If s is a minimal bad interval for F and t is a bad interval such that s %t, then t is bad for Fls. 

Proof: Let s = [a . . b) and t = [c . . d) . We can assume by left-right symmetry that a < c. Then b < d, 
by minimality of s. Assume that t isn't bad for Fls. The subfamily F\t must contain at least one of the 
maximal intervals [aj . . bj) of F\s that are deleted during the reduction; hence c < aj < bj_i < b. 
Let j be minimal with aj > c. Then 

Fls \t = F\t \ {[aj . . bj), . . . , [o fc . . b k )} U {[aj . . bj-!), . . . ,[a k . . b^)} ; 

so the elements of t that are covered once less often are the elements of \bj-i ■ ■ bk). Suppose y E [bi-i . . bi) for 
some i > j. Then y G [aj . . bi) C sCit, and y is in some other interval / C s. The maximal interval containing 
/ must be [a; . . bi) for some I > i, hence / C s fl t. Thus N y F\ (s D t) > 2 for all y G [bj-i . . bk). But s P\t 
is good, so there must be a point x e [c . . bj-\) with N x F\ (sflt) < 1. We also have N x F\t > 2, since t is 
bad, so there's an interval in F\t \ F\(s fl t) that contains x. This interval contains [x . . bk). Consequently 
N v F\t > 3 for all y e [bj-! . . b k ). 
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22. Lemma. If s and t are minimal bad intervals for F and s j^t, we have Fjsjt = Fj.tj.s. 

Proof: Let the maximal intervals in F\s and F\t be [ai..b\), [ak . .bk) and [c\..d\), [c/..d;), 
respectively, where a\ < c\. The lemma is obvious unless F\(s D t) is nonempty, so we assume that 
ci < 6fc < d;. Let x G s n t have F| (sflf) = 1, and let / be the interval of F\ (s n t) that contains x. 
Let p be maximal with a p < c±, and let q be minimal with d q > b k . Since N x F\s > 1, there is an interval 
[aj . . bj) containing x with j < p; thus x G [a p . . b p ). Similarly x G [c g . . d q ). Furthermore, if p < k we have 



,j p+i) C s n t; hence either [a p+ i . . 6 p +i) 



/ or a p+1 > x. If q > 1 we have either [c, 



q— 1 • • dq— 1/ 



or d q -i < x. 

If p = k or / ^ [a p+ i 



. 6 p+ i), any newly added intervals [a^+i . for p < j < k in FJ,s are properly 



contained in [c 9 . . ci 9 ), so they remain in FJ.sJ.t. Thus we can easily describe the compound operation F|s|t 
in detail: 

Delete [<n . . h), [a k . . b k ); insert [a 2 . . . . . , [a k . . b k -i); 

delete [ci . . di), . . . , [q . . dj); insert [c 2 . . di), . . . , [cj . . dj-i). 

No two of these intervals are identical, so F\,tis gives the same result. (If / = [c g _i . . d g _i), the family Fit 
has / replaced by [c q . . d q -\) C / C [a, . . 6j) for some i < p, so [c g . . d 9 _i) is not maximal in Fit s.) 

The remaining case / = [a p +i . . & p +i) = [c 9 _i..d ? _i) needs to be considered specially, since we can't 
delete this interval twice. The following picture might help clarify the situation: 



ai 



-o 6i 



a 2 • 



( a p+i 



ci 



Cg-l 



-o di 



-o 6 2 



-o b p+ i same as [c ? _i . . d 9 _i) below ) 
o 6 fe 



-o d„_i 



-o d 



-O d; 



Suppose g is an interval of F\t that is contained in [cj . . dj) if and only if j 



1. Then g contains a 



point < c . If a; G g we have g = /, since g C s n t. Otherwise g C [c _i . . x) C [c -i . . 6 P ) = [a p +i . . 6 P ). It 
follows that the maximal intervals of Fl s 1 1 are 

[ci..di), [c 9 _ 2 . . d _ 2 ), [c g _i..6p), [c q ..d q ), [cj..dj). 

These intervals are replaced in Fjsjt by 

[C 2 ..dl), [Cg_l . . dg_ 2 ), [Cg..6 p ), [Cg+l-.dg), . . ., [C( . . rfj _ 1 ) - 

Thus i^jslf is formed almost as in the previous case, but with [a p +i . . b p ) and [c 9 ..d q _i) replaced by 
[c q . .bp). And we get precisely the same intervals in Fit Is. 
(Is there a simpler proof?) 
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23. Practice. The computer program in the remainder of this note operates on a family of intervals 
defined by a graph onn + 1 vertices {0,1, ... ,n}. We regard an edge between u and v as the half-open 
interval [it . . v), when u < v. 

Graphs are represented as in the algorithms of the Stanford GraphBase [8] , and the reader of this program 
is supposed to be familiar with the elementary conventions of that system. 

The program reads two command-line parameters, m and n, and an optional third parameter representing a 
random-number seed. (The seed value is zero by default.) The Franzblau/Kleitman algorithm is then applied 
to the graph random_graph(n + 1, m, 0, 0, 0, 0, 0, 0, 0, seed), a random graph with vertices {0, 1, . . . , n} and 
m edges. Alternatively, the user can specify an arbitrary graph as input by typing the single command-line 
parameter -g ( filename } ; in this case the named file should describe the graph in save^graph format (as in 
the MILES_SPAN program of [8]). 

When the computation is finished, a minimal generating family and a maximal irrcdundant subfamily will 
be printed on the standard output file. 

If a negative value is given for n, the random graph is reversed from left to right; each interval [a . . b) 
is essentially replaced by [—b..— a) (but minus signs are suppressed in the output). This feature lends 
credibility to the correctness of our highly asymmetric algorithm and program, because we can verify the 
fact that the minimum generating family of the mirror image of F is indeed the mirror image of F's minimum 
generating family. 

In practice, the algorithm tends to be interesting only when m and n are roughly equal. If n is large 
compared to m, we can remove any vertices of degree zero; such vertices aren't the endpoint of any interval. 
If m is large compared to n, we can almost always find n irrcdundant intervals by inspection. The running 
time in general is readily seen to be 0(mn + n 2 ). 

#define panic(k) 
{ 

fprintf (stderr , " Oops , u we ' re u out u of u memory ! u (Case u 7,d) \n" , k) ; 
return k; 

} 

^include "gb_graph.h" /* the GraphBase data structures */ 
^include "gb_rand.h" /* the random_graph generator */ 
^include "gb_save.h" /* the restore-graph generator */ 

( Preprocessor definitions ) 

Graph *F; /* the graph that defines intervals */ 
Graph *G; /* a graph of intervals that generate F */ 

( Subroutines 33 ) 

main(argc, argv) 

int argc; /* the number of command- line arguments, plus 1 */ 

char *argv{]; /* the command-line arguments */ 

{ 

register Vertex *t, *u, *v, *w, *x; /* current vertices of interest */ 

register Arc *a, *b, *c; /* current arcs of interest */ 

( Scan the command- line options and generate F 24 ) ; 
(Compute G and S by the Franzblau/Kleitman algorithm 26); 
if (gb-trouble-code) panic (1); 

( Construct an irredundant subfamily of F with the cardinality of G 32 } ; 
( Print the results 36 ) ; 

return 0; /* this is the normal exit */ 

} 
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24. ( Scan the command-line options and generate F 24 ) = 
{ _ 

int m = 0, n = 0, seed = 0; 

if (argc > 3 A sscanf (argv [1], "7.d",&m) = 1 A sscanf(argv [2], "7.d",&n) = 1) { 
if (argc > 3) sscan/(ari7v[3], "7.d", &seed); 
if (to < 0) { 

to = —to; /* we assume the user meant to negate n instead of m */ 

n = —n; 

} 

if (n > 0) F = random_graph(n + 1, to, 0, 0, 0, 0, 0, 0, 0, seed); 
else { 

G = random-graph (— n + 1, to, 0, 0, 0, 0, 0, 0, 0, seed); 
{ Set F to the mirror image of G 25 } ; 
gb_recycle(G); 

} 

} 

else if (argc = 2 A strncmp(argv [1], "-g", 2) = 0) F = restore_graph(argv[l] + 2); 
else { 

fprintf (stderr , "Usage : u /.s u m u n u [seed] uu I U u7oS u -gfoo . gb\n", argv[0], argv[0]); 
return 2; 

} 

if (-F) { 

fprintf (stderr , "Sorry , u can't u create u the u graph! u (err or u code u °/,ld)\n", pamc_co<ie); 
return 3; 

} 

} 

printf(" Apply ing u Franzblau/Kleitman u to u / s : \n", F^id); 
This code is used in section 23. 

25. ( Set F to the mirror image of G 25 ) = 
F = gb-neui-graph(G-m); 

if (~F) panic (4); 

make-CompoundJd(F, "reflect (", G, ")"); 

for = G-'vertices , u = F^vertices + F^n — 1; v < G^vertices + G^n; u — , v++) { 
v^clone = u; 

u-^name = gb_save_string (v^name); 

} 

for (v = G^vertices; v < G^vertices + G^n; v++) 

for (a — v^arcs; a; a = a-*next) gb_new_arc (v- clone, a-^tip^ clone, 1); 
This code is used in section 24. 
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26. The main algorithm. We follow the outline of §15. 

(Compute G and S by the Franzblau/Kleitman algorithm 26} = 
(Make G a copy of F, retaining only leftward arcs 27); 
for (v — G^vertices + 2; v < G^vertices + G^n; v++) 

( Reduce all minimal bad intervals with right endpoint v and record them in S 28 } ; 

This code is used in section 23. 

27. The algorithm doesn't need pointers from the left endpoint of an interval to the right endpoint; leftward 
pointers are sufficient. (This observation makes the reduction procedure faster.) 

While copying F, we remove its rightward arcs, and we assign length 1 to all its leftward arcs. Later on, 
we will represent intervals of S by recording them in F as leftward arcs of length — 1. 

We also clear two utility fields of F's vertices, since they will be used by the algorithm later. (They might 
be nonzero, if F was supplied with the -g option.) 

#define clone u.V /* the vertex in F that matches a vertex in G, or vice versa */ 

( Make G a copy of F, retaining only leftward arcs 27 } = 

switch-to -graph (A); /* prepare to return to graph F later */ 

G = gb-neui-graph (F-*n) ; /* a graph with nameless vertices and no arcs * / 

if (-iG) panic (5); 

for (u = F^vertices ,v = G^vertices; u < F^vertices + F^n; u++,v++) { 
undone = v; v-'done = u; 
u-y.I = u-z.I = 0; 
v^name = gbsavestring(u-'name); 
for (a = u-^arcs,b = A; a; a = a-^next) 

if (a-tip > u) { /* we will remove the non- leftward arc a */ 
if (b) b^next = a-^next; 
else u-^arcs = a-^next; 
} else { /* we will copy the leftward arc a */ 

gb-new-arc(v,a-'tip-'clone,Q); /* the length in G is */ 

aAen = 1; /* but in F the length is 1 * / 

b = a; /* b points to the last non-removed arc */ 

} 

} 

if (gb-trouble-code) panic (6); 

switch-to -graph (F); /* now we can add arcs to F again */ 

This code is used in section 26. 
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28. Here's the most interesting part of the program, algorithmwise. Given a vertex v, we want to find the 
largest u, if any, such that [u . . v) is bad for the intervals in G. So we sweep through the intervals [u . . v) 
from right to left, decreasing u until it reaches a limiting value t. Here t is the least upper bound on a left 
endpoint that could guarantee double coverage of all points in [u. .v). 

The utility field x^count records the number of intervals with left endpoint x and right endpoint w in the 
range u < w < v. Another utility field xAink is used to link vertices with nonzero counts together so that 
we can clear the counts to zero again afterwards. 

It's easy to see that each iteration of the while loop in this section takes at most 0(m + n) steps. The 
actual computation time is, however, usually much faster. 

This program is designed to work correctly when G contains more than one arc from u to v. Duplicate 
arcs are discarded as a special case of the reduction procedure. 

#define count z.I /* coverage decreases by this much when we pass to the left */ 
#define link y.V /* pointer to a vertex whose count field needs to be zeroed later */ 

( Reduce all minimal bad intervals with right endpoint v and record them in S 28 } = 
{ 

while (1) { 

int coverage = 0; /* the number of intervals C [t . .v) that contain u */ 
int potential = 0; /* sum of x^count for x < t */ 

Vertex * cleanup = A; /* head of the list of vertices with nonzero count */ 

for (u = v, t = v — 1; u > t; u — ) { 

coverage —= u-^count; /* we prepare to decrease u */ 
( Update the counts for all intervals ending at u 29 } ; 

if {coverage + potential < 2) { /* there's no bad interval ending at v */ 
( Clean up all count fields 30 } ; 
goto donejwithjv ; 

} 

while {coverage < 2) { 

t— ; 

coverage += t-^ count; 
potential —= t-^count; 

} 

} 

gb_new-arc{v->clone , w clone , —1); /* [u..v) is minimally bad; we record it in S = —F */ 
( Replace G by G[ [u . . v) 31 ) ; 

(Clean up all count fields 30 }: /* now we'll try again */ 

} 

donejwithjv : ; 
} 

This code is used in section 26. 
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29. ( Update the counts for all intervals ending at u 29 ) = 
for (a = u-^arcs; a; a — a-*next) { 

w = a-tip; 

if (w^count = 0) { 

w-'link = cleanup; 

cleanup — w; 

} 

w^count++; 

if (w > t) coverage ++; 

else potential ++; 

} 

This code is used in section 28. 

30. { Clean up all count fields 30 ) = 

for (w = cleanup; w; w — w^link) w^count = 0; 
This code is used in section 28. 

31. The reduction process is kind of cute too. 

( Replace G by G[ [u . . v) 31 ) = 

for (a = v^arcs, c = A, w = v; a; c = a, a = a-^next) 
if (a-^tip > u A a-^tip < w) w = a-^tip 7 b — c; 

/* now [w . . v) is the longest interval from v inside [u . . v); we'll remove it */ 
if (b) b^next — b-^next^next; 
else v^arcs — v^arcs^next; 

I * the remaining job is to shorten the other maximal arcs in [u . . v) */ 
for (t = v — l; w > u; t — ) { 

for (a = t-^arcs,x = w; a; a = a-^next) 

if {a-^tip > u A a-^tip < x) x = a^tip.b = a; 
if (x < w) b^tip = w,w = x; /* [x . . t) is the longest interval from t */ 

} 

This code is used in section 28. 
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32. The denouement. Now we build a binary tree in the original graph F, by filling in some of the 
utility fields of F's vertices. If a node in the tree is labeled with x and with the interval [u, v), we represent 
it by x-left — u and x^right — v; the subtrees of this node are x-llink and x-rlink. The root of the whole 
tree is F-root. 

The rlink field happens to be the same as the count field, but this is no problem because the rlink is never 
changed or examined until after the count has been reset to zero for the last time. 

#define left x.V /* left endpoint of interval labeling this node */ 

#define right w.V /* right endpoint of interval labeling this node */ 

#define llink v.V /* left subtree of this node */ 

#define rlink z.V /* right subtree of this node */ 

#define root uu.V /* root node of the binary tree for this graph */ 

( Construct an irredundant subfamily of F with the cardinality of G 32 } = 

F-root = make-tree(F-vertices , F^vertices + F^n — 1); 
This code is used in section 23. 



33. With a little care we could maintain a stack within F itself, but it's easier to use recursion in C. Let's 
just hope the system programmers have given us a large enough runtime stack to work with. 
This subroutine is based on the trick explained in §20. 

(Subroutines 33) = 
Vertex * make-tree (t,w) 
Vertex *t, *w; 

{ 

register Vertex *u, *v, *x; 
register Arc *a; 

( Find a vertex x with N x F\ [t . . to) - N x S\ [t . . to) = 1 34 ); 

if (—*x) return A; /* F\[t..w) is empty */ 

( Find an interval [u. .v) such that x G [u . . v) C [t . . w) 35 }; 

x-left — u; 

x-right = v. 

x-llink = make-tree (t, x) ; 
x-rlink = make-tree (x + 1, to); 
return x; 

} 

See also section 37. 

This code is used in section 23. 
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34. A subtle bug is avoided here when we realize that a vertex might already be in the cleanup list when 
its count is zero. 

( Find a vertex x with N x F\ [t ..w)- N x S\ [t ..w) = 1 34 ) = 

! 

register int coverage — 0; /* coverage in F minus S */ 
Vertex * cleanup = w + 1; /* iu + 1 is a sentinel value */ 

for (v = w,x = A; v > t; v — ) { 

coverage —= v^count; /* now coverage refers to iV„_i */ 
for (a = v^arcs; a; a = a-*next) { 
u = a-'tip; 
if (u>t) { 

if (unlink = A) u-link = cleanup, cleanup = u; 

u-^count += a-len; /* the length is +1 for F, —1 for S */ 

coverage += a-len; 

} 

} 

if (coverage = 1) { 
x = v — 1; break; 

} 

} 

if (->£ A cleanup < w) fprintf (stderr , "This u can't u happen!\n"); 
while (cleanup < w) { 

v = cleanupAink; 

cleanup-* count = 0; 

cleanup^link = A; 

cleanup = v; 

} 

} 

This code is used in section 33. 

35. ( Find an interval [u . . v) such that x £ [u . . v) C [t . . w) 35 ) = 
for (v = u>; v > x; w — ) { 

for (a = v^arcs; a; a — a-*next) 
if (a-Zen > 0) { 
u = a-'tip; 

if (u<iAii>() goto done; 

} 

} 

done: 

This code is used in section 33. 

36. ( Print the results 36 ) = 

printf ( "Minimum u gener at ing u f amily : " ) ; 

for (v — G^vertices + 1; v < G^vertices + G-n; v++) 

for (a — v^arcs; a; a — a-*next) printf (" u [°/,s . . °/.s)" , a^tip~name,v~name); 
prm£/(''\nMaximum u irredundant u f amily : "); 
postorder -print (F^root ) ; 
printf ("\n"); 

This code is used in section 23. 
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37. There's just one subroutine to go. This is textbook stuff. 

(Subroutines 33} += 
void postorder-print (x) 
Vertex *x; 

{ 

if (x) { 

postorder-print (x-llink ) ; 
postorder_print (x^rlink ) ; 

printf ( " u°/°s [7.s . . 7«s ) " , x^name , x-left-name , x^right^name ) ; 

} 

} 
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38. Comments and extensions. The program just presented incorporates several refinements to the 
implementation sketched by Franzblau and Klcitman in [3], and the author hopes that readers will enjoy 
finding them in the code. The Stanford GraphBase provides convenient data structures, by means of which it 
was possible to make the program short and sweet. However, most of the key ideas (except for the make_tree 
procedure) can be found in [3]. 

39. Lubiw [10] discovered that the algorithm of Franzblau and Kleitman can be generalized so that it finds 
optimum irredundant subfamilies and generating families in an appropriate sense when the points of the 
underlying line have been given arbitrary nonnegative weights. It should be interesting and instructive to 
extend the program above so that it handles this more general problem. 

40. The introductory remarks in §2 mention the recent breakthrough by Frank and Jordan [2], who showed 
(among many other things) that Gyori's theorem can be extended to intervals on a circle as well as a line. 
Such a generalization was surprising because the size of a minimum generating set might be strictly larger 
than the size of a maximum irredundant subfamily of cyclic intervals. For example, the n intervals [k . . fc + 2) 
for < k < n on the ring of integers mod n are obviously redundant; if we leave out any one of them, the 
remaining n — 1 intervals will cover all n points. However, these cyclic intervals cannot be generated by 
fewer than n subintervals: No n — 1 subintervals of length 1 will do the job, and if [k . . k + 2) is one of the 
generators the remaining n — 1 intervals require n — 1 further generators because they are irredundant. 

Gyori's minimax principle is restored, however, if we change the definition of irredundant families. We can 
say that F is irredundant if each / G F has a distinguished element /* € /, such that whenever / and /' are 
distinct sets of F we have either /* ^ /' or fl ^ /. If F is irredundant in this sense, and if G generates F, it 
is not difficult to prove that \F\ < \G\: There is a <?/ <E G for each / G F, with the property that /* e <?/ C /; 
our new definition guarantees that gf ^ g/> when / ^ /'. 

According to this new definition, the intervals [k . . k + d) for < k < n, modulo n, are irredundant 
whenever n > 2{d — 1), because we can let [k . . k + d)* = k. Frank and Jordan showed that if F is any 
family of intervals modulo n with the property that each intersection / n /' of two of its members is either 
empty or a single interval, then the size of F's smallest generating family is the size of its largest irredundant 
subfamily under the new definition. 

41. For intervals on a line, Gyori [4] had already observed that both definitions of irredundancy are 
equivalent. Suppose a system of representatives /* E / is given for all / in some family F of intervals on 
a line, such that / ^ f implies /* ^ /' or ^ /. If we cannot arrange those intervals in a sequence / W, 
/( 2 ), /W such that i /W U • • • U for 1 < j < n, there must be some cycle of intervals 
such that /i 1 ^ £ f^ 2 \ /i 2 ' G • ■ ■ , f*™* £ where m > 2. Consider the shortest such cycle, and 
suppose /j 1 ^ = minjij We cannot have < for 1 < k < m, because /i™' G /W; let k > 1 be 

minimum such that is not strictly less than f* k \ Then must be strictly greater than f* k \ and 

we have < < /i fc ~ 1} . There is some j with 1< j < k and f^ 1} < < f^; since f^ l) G 
we have /i^ G a shorter cycle. This contradiction shows that no cycles exist. 

42. Frank and Jordan gave another criterion for irredundancy that works also for general families of 
intervals on a circle when large intervals might wrap around so that their intersection/ n /' consists of two 
disjoint intervals. In such cases they allow {/*,/*} C /', but only if /* and lie in different components 
of / n /'. For example, the intervals [k . . k + d) for < k < n, modulo n, are irredundant by this definition 
for all n > d. Once again the minimax theorem for generating families and irredundant subfamilies remains 
valid, in this extended sense. 

43. The algorithms presented by Frank and Jordan [2] for such problems require linear programming as a 
subroutine. Therefore it would be extremely interesting to find a purely combinatorial procedure, analogous 
to the algorithm of Franzblau and Kleitman, either for the wrap-restricted situation of §40 or for the more 
general setup of §42. 
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( Clean up all count fields 30 ) Used in section 28. 

(Compute G and S by the Franzblau/Klcitman algorithm 26} Used in section 23. 

( Construct an irredundant subfamily of F with the cardinality of G 32 } Used in section 23. 

( Find a vertex X with N x F\ [t . . W) - N x S\ [t . . w) = 1 34 ) Used in section 33. 

( Find an interval [u . . v) such that x G [u . . v) C [t . . w) 35 ) Used in section 33. 
( Make G a copy of F, retaining only leftward arcs 27 } Used in section 26. 

( Print the results 36 } Used in section 23. 

( Reduce all minimal bad intervals with right endpoint v and record them in S 28 ) Used in section 26. 
( Replace G by G[ [u . . v) 31 } Used in section 28. 

( Scan the command-line options and generate F 24 ) Used in section 23. 
( Set F to the mirror image of G 25 ) Used in section 24. 
( Subroutines 33, 37 } Used in section 23. 

( Update the counts for all intervals ending at u 29 } Used in section 28. 



